Integrating genomic information and productivity and climate-adaptability traits into a regional white spruce breeding program

Tree improvement programs often focus on improving productivity-related traits; however, under present climate change scenarios, climate change-related (adaptive) traits should also be incorporated into such programs. Therefore, quantifying the genetic variation and correlations among productivity and adaptability traits, and the importance of genotype by environment interactions, including defense compounds involved in biotic and abiotic resistance, is essential for selecting parents for the production of resilient and sustainable forests. Here, we estimated quantitative genetic parameters for 15 growth, wood quality, drought resilience, and monoterpene traits for Picea glauca (Moench) Voss (white spruce). We sampled 1,540 trees from three open-pollinated progeny trials, genotyped with 467,224 SNP markers using genotyping-by-sequencing (GBS). We used the pedigree and SNP information to calculate, respectively, the average numerator and genomic relationship matrices, and univariate and multivariate individual-tree models to obtain estimates of (co)variance components. With few site-specific exceptions, all traits examined were under genetic control. Overall, higher heritability estimates were derived from the genomic- than their counterpart pedigree-based relationship matrix. Selection for height, generally, improved diameter and water use efficiency, but decreased wood density, microfibril angle, and drought resistance. Genome-based correlations between traits reaffirmed the pedigree-based correlations for most trait pairs. High and positive genetic correlations between sites were observed (average 0.68), except for those pairs involving the highest elevation, warmer, and moister site, specifically for growth and microfibril angle. These results illustrate the advantage of using genomic information jointly with productivity and adaptability traits, and defense compounds to enhance tree breeding selection for changing climate.


Introduction
White spruce (Picea glauca (Moench) Voss) is one of the most widely distributed North American conifer species and commercially, one of the most important tree species in the Province of Alberta (Canada) [1]. To date, most forest tree' quantitative genetic studies and tree improvement programs are primarily focused on economically important productivity traits (productivity-related traits), such as growth and wood quality (e.g., [2][3][4]). However, the ongoing rapid climate change resulting in higher frequency and severity of drought events has begun to change the focus of selection. In addition to directly affecting tree productivity, drought can have a profound effect on tree susceptibility to pests and pathogens [5]. Therefore, climate change-related (adaptive) traits including plasticity and adaptation to drought, forest pest and pathogens resistance should be incorporated into existing tree breeding programs [6,7]. Aligning with this recommendation, and reviewing 260 global tree pest and disease resistance initiatives, Yanchuk and Allard [8] reported very few tree improvement programs that operationally succeeded in deploying resistant material. Moreover, for a better understanding of the interplay between productivity-and adaptability-related traits, breeders need to study which secondary compounds are associated with these traits and understand their inherent variation.
In the context of global climate change, knowledge of traits' variance components and their genetic parameters such as heritability and correlations between productivity-, adaptabilityrelated traits, and chemical compounds related to defense and drought stress, are vital for the development of effective tree breeding programs. Moreover, either the simultaneous maximization/optimization of potential genetic gain for multiple traits, or understanding the genetic × environment (G×E) interaction from multiple site analyses, are essential to increasing tree resilience toward environmental perturbations, and for ensuring the sustainable longterm genetic progress of a breeding program [9]. Several studies have reported pedigree-based (see below) genetic parameters for productivity-related traits, such as growth and wood quality in white spruce [3,4,[10][11][12][13][14][15][16] as well as pest resistance traits [17][18][19][20]. However, few studies have focused on drought resilience [21] and defense chemical traits [22]. Therefore, the genetic control, cross-site stability (i.e., G×E), and correlation of most of these adaptability-related traits remain to be understood.
To obtain precise genetic parameter estimates (or function of them), accurate information of individuals' genealogy is required [23]. The individual-tree mixed model utilizes individuals' contemporary pedigree information to estimate the additive genetic variance using Henderson's average numerator relationship matrix (A-matrix) [24]. However, the A-matrix estimates ignore all historical relationships beyond that of the contemporary pedigree as all relationships are based on identity-by-descent rather than actual relationships [25]. Thus, the accuracy of genetic parameter and predicted breeding value rankings are compromised [26]. On the other hand, the use of genomic information through molecular markers to infer the realized genomic relationship matrix (G-matrix; [27]) offers an efficient alternative to constructing the additive relationship matrix, and effectively estimating individuals' realized genetic relatedness. Recently, studies in forest trees have tested the value of molecular markers for estimating genetic parameters using the G-matrix [2-4, 10, 28, 29]. However, these studies only focused on growth and/or wood quality traits and limited work examined drought resilience, and/or pest resistance via chemical defense traits [20,30]. As part of a large-scale tree genomic study [31] we genotyped 1,540 white spruce trees with 467,224 SNPs and phenotyped them for various productivity-, adaptability-related traits, and defense monoterpenes. These trees represent a subset of open-pollinated progeny being tested and grown on multiple genetic test sites throughout the Province of Alberta [31]. The available genotypic and phenotypic information for these trees offered a unique opportunity to evaluate the genetic control and relationships of the assessed traits, and the extent of G×E interactions. Here, we studied 15 growth, wood quality, drought resilience, and defense and drought stress chemical traits (monoterpenes), and estimated their quantitative genetic parameters (including heritability and genetic correlations) within and across-sites. Estimates were obtained and compared using both pedigree-and genomic-based relationship matrices. The results of this study are expected to provide critical information needed for the identification and selection of genetic material for their inclusion in new production populations (seed orchards). New second generation orchards will replace the aging first generation orchards which currently supply 65% of all white spruce reforestation stock in Alberta (Andy Benowicz, personal communication). There is an urgent need to change the orchard production profiles from the current ones focused on improved growth only, to the ones focused on improved climate resiliency.

Genetic material and trial description
Three open-pollinated (OP) progeny trials (Calling Lake: CALL, Carson Lake: CARS, and Red Earth: REDE) of the Alberta Agriculture and Forestry white spruce Region D1 breeding program [32] were used in this study (Table 1 and Fig 1). These trials were planted in a randomized complete block design with six replicates and 5-or 6-tree row plots at 2.5× 2.5 m spacing ( Table 1). The entire population being tested in the three progeny trials consisted of 150 families from 10 provenances. Based on age-30 tree height, a sub-sample of 80 families were selected representing low-, average-and high-class heights, each with approximately eight individual progeny per family for CALL and REDE, and four progeny for CARS (n = 1,483). An additional 142 potential forward selected trees, previously identified in the three progeny trials and based on height breeding values, were also included for sequencing. From these 142 forward selected trees, 34 trees were from an additional 19 families, resulting in a total of 1,625 trees from 99 families.

Traits evaluated
Diameter at breast height (1.3 m; DBH) and tree height (HT) were measured at age-30 and represent the growth productivity traits measured. Wood density (WD) was measured using a 5 mm bark to pith increment cores taken close to breast height on the north facing side of each tree. Cores were transported in straws to the lab, soxhlet extracted overnight with hot acetone, precision cut to 1.68 mm thickness with a twin blade pneumatic saw, and allowed to acclimate to 7% moisture before density analysis. All samples were then scanned from pith to bark by Xray densitometry (Quintek Measurement Systems, TN) at a resolution of 0.0254 mm. We report data as relative density on an oven-dry weight basis. Finally, average WD was calculated as the weighted WD of the individual tree rings weighted by their annual basal area increment (BAI) to better represent the density of the whole tree. Given that juvenile rings have less reliable measurements we discarded tree rings prior to 1995. Microfibril angle (MFA) was determined by X-ray diffraction by determining the 002 diffraction arc (T-values) using a Bruker D8 Discover X-ray diffraction unit equipped with an area array detector (GADDS) on the radial face of the individual growth rings, as previously detailed by Ukrainetz et al. [36].
Two dendrochronological indices were calculated from tree ring information: drought resistance (Resistance) and mean drought sensitivity (Sensitivity). Resistance represents the ability of a tree to maintain growth during a specific drought episode, in this case it occurred in 2015, and was calculated using the following equation [37]: Resistance = BAI drought /BAI pre−drought , where BAI drought is the average BAI of the drought event (2015) and BAI pre−drought is the average BAI of the four years before the drought event (2011-2014) (see S1 Fig). Resistance describes how much the incremental growth is reduced during a drought event. As such, a Resistance value close to 1 represents a tree unaffected by the drought, while smaller values represent less resistant trees. Sensitivity is a classic dendrochronological index commonly used to estimate the responsiveness of trees to climate [38], and was calculated as: where, BAI t is the BAI measured at year t and n is the total number of years measured. Trees with high climatic sensitivity are able to grow particularly well under good environmental conditions but are more severely affected by drought events. The two residual outside pieces of the cores (slabs), retained during pneumatic processing of the density specimens along the increment cores radial direction of the cross section, were used to capture the variation in the stable carbon isotope ratio (δ 13 C) across all years measured on each tree. The slabs were dried and ground using a Qiagen TissueLyser II (Qiagen Inc., Hilden, Germany). During grinding, each sample was placed into an individual stainless-steel jar with a 2 cm stainless-steel ball. The ground samples were then analyzed for δ 13 C at Alberta InnoTech Stable Isotope Laboratory, Victoria, Canada. The analysis was performed using an established method on a MAT 253 mass spectrometer with Conflo IV interface (Thermo Fisher Scientific, Waltham, MA, USA.) and a Fisons NA1500 EA (Fisons Instruments, Milano, Italy). In brief, approximately 1 mg of solid sample was weighed into tin capsules then placed into a combustion reactor that produces CO 2 , which was then analyzed by mass spectrometry for isotopic estimates. Multiple in-house standards, calibrated relative to international standards, were run to allow the results to be normalized and reported vs. Vienna Pee Dee Belemnite. δ 13 C values were used as a measure of intrinsic long-term water use efficiency (WUE).
The defense compounds identified and quantified were mainly monoterpenes assessed from needles collected from south facing branches near the crown of the trees during May-June (2017), and from the 99 families selected and studied across all three test sites (n = 1,602) (see S1 Text "Chemical analysis" for details). Briefly, needle samples were kept at -40˚C and ground to a powder for extraction. Hexane-extractable compounds were identified and quantified with a gas chromatography-flame ionization detector using methods modified from [39]. We identified 12 hexane-extractable compounds and used seven monoterpenes (αpinene, β-pinene, camphene, myrcene, limonene, terpinolene, and camphor), including the sum of all hexane-extractable compound concentrations (total monoterpenes), in the characterization of genetic parameters. Many of these compounds can be anti-feedants for Choristoneura fumiferana (eastern spruce budworm; [40][41][42]). The remaining chemical compounds did not fit model assumptions and were not included in the analyses.
Logarithmic transformations were applied to MFA and all monoterpene compounds to improve data normality (see S2 Fig). Additionally, prior to the multivariate analyses, all the phenotypic data were spatially adjusted [43] using the design effects. Design adjusted phenotypic data were obtained for each tree for each trait and site by subtracting the estimated replication effects from the original phenotype. Finally, data of all traits were standardized (mean = zero and variance = 1). The list of traits, number of trees for each trait, and summary statistics for all the phenotypic traits in their original scale (i.e., without design adjustment) are presented in Table 2.

Genotyping-by-sequencing
Following Chen´s et al. [44] genotyping-by-sequencing (GBS) protocol, the DNA from each needle sample was prepared with EcoT22-I (ATGCA) restriction enzyme digestion. Sequencing reads of 1,625 trees were aligned to the most up-to-date white spruce assembly (WS77111-v2, [45]) using BWA [46] and TASSEL-GBS [47]. Of the total 30 million SNP read tags constructed,~26 million tags (87.5%) were aligned to the genome assembly and 4.5 million SNPs were determined with an individual site depth at 4x coverage. A set of 1,599 trees and 467,224 (467K) biallelic SNPs were obtained based on filtering the SNP data set for a maximum missing data proportion of 30%, a minor allele count of one, and maximum site read depth < = 70. Missing data were imputed using the mean observed allele at each locus.

Pedigree correction
Using the filtered SNP subset, we validated and corrected the pedigree of the OP families based on the comparison of the expected versus observed additive genetic relationships using a custom R-script. Samples' pairwise additive relationship coefficients of the G-matrix (see below) were examined for large deviations from their expected values (e.g. 0.25 for half-sib) and corrected parentage was assigned or reassigned manually. We removed 59 sampled trees for parent conflicts. Of the final set of 1,540 trees, 202 trees' pedigree records were modified or corrected. These changes mostly stemmed from the identification of 5 phantom mothers and 100 pollen donors (fathers), which increased the number of identified parents for the 1,540 white spruce trees from 99 (original pedigree) to 204 (corrected pedigree). The number of genotyped trees per mother had a range of 1-20, and from 1 to 8 per site.

Quantitative genetics analyses
Our single-trait single-site analysis used a univariate individual-tree mixed model as following: where, y is the vector of phenotypic data; β is the vector of fixed effects genetic groups formed according to provenances; d is the vector of random design effects, including replications, however, given that in general just one RES-FOR trees was sampled from each 4-tree row plot, the plot effects were not fitted; a is the vector of random genetic effects following a normal distribution with zero mean and (co)variance matrix As 2 a , where A is the average numerator relationship matrix and s 2 a is the additive genetic variance; and e is the vector of the random residual effect following also a normal distribution with zero mean and (co)variance matrix Is 2 e , where I is the identity matrix and s 2 e is the residual error variance. X, Z d , and Z a , are incidence matrices relating fixed and random effects to measurements in vector y.
Genetic correlations between different traits measured from the same individual, and genetic correlations between sites, considering measurements from different sites as different traits, were estimated based on the following multiple-trait individual-tree mixed model: where, ½y 0 i j � � � jy 0 j � included the individual-tree spatially adjusted phenotypes for all traits and sites; the genetic groups effects for each trait or site are included in ½β 0 i j � � � jβ 0 j �; the genetic effects (breeding values) of all individuals for all the traits or sites are included in ½a 0 i j � � � ja 0 j �, and ½e 0 i j � � � je 0 j � is the residual vector. The incidence matrices X i �� � �� X j , and Z a i � � � � � Z a j related observations in ½y 0 i j � � � jy 0 j � to elements of ½β 0 i j � � � jβ 0 j � and ½a 0 i j � � � ja 0 j �, respectively. The symbols L and ' indicates the direct sum of matrices and transpose operation, respectively. Finally, the expected value and variance-covariance matrix of the genetic effects in model (2) are respectively equal to:  where, s 2 a ii and s 2 a jj are the genetic variances for the traits or sites i and j respectively, s a ij is the genetic covariance between traits or sites i and j, and A is defined above for the single-trait single-site model. The symbol � indicates the Kronecker products of matrices. The expected value and variance-covariance matrix of e are equal to: The residual variances for traits or sites i and j were s 2 e i , and s 2 e j , respectively, s e ij is the residual covariance between traits i and j, and I is the identity matrix. Given that the sites were assessed separately, the residual covariances across-sites were assumed to be zero.
In the genomic-based approach, the pedigree-based relationship matrices A (A-matrix) for genetic effects, of the previous mixed models (1) and (2), were substituted by the corresponding genomic relationship matrix (G-matrix) based on 467K SNPs.
where, W is the n × m (n = number of individuals, m = number of SNPs) rescaled genotype matrix following M-P, where M is the genotype matrix containing genotypes coded as 0, 1, and 2 according to the number of alternative alleles, and P is a vector of twice the allelic frequency, p i . Estimates of pedigree-and genomic-based variances for the genetic effects (ŝ 2 a ,) and residual errors (ŝ 2 e ), were re-parameterized to individual-trait narrow-sense heritability (ĥ 2 ) and genetic correlations (r a ) between traits, or sites i and j, as follows: ;r a ¼ŝ a i;j ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffi ffî s 2 a i;i �ŝ 2 a j;j q Visualization of genetic correlations between traits was done using the corrplot function in Rpackage corrplot [48]. Correlations between traits or sites were considered strong ifr a � 0.70, moderate if 0.70 >r a > 0.40, and low or weak whenr a � 0.4. Univariate model (1) and multivariate model (2) were fitted in R (www.r-project.org) with the function remlf90 from the package 'breedR' [49], using the Expectation-Maximization (EM) algorithm followed by one iteration with the Average Information (AI) algorithm to compute the approximated standard errors of the variance components [50]. The remlf90 function in the R-package 'breedR' is based in the REMLF90 (for the EM algorithm) and AIR-EMLF90 (for the AI algorithm) of the BLUPF90 family [51]. The program preGSf90, also from the BLUPF90 family [51], was used to create the inverse of the G-matrices calculated with the 467K SNPs markers, and then used to fit models (1) and (2) with the 'breedR' package.

Pedigree-and genomic-based relationship estimations
To study the expected (pedigree) and realized (genomic) relationship structures in the genotyped population, individual pairwise relatedness was estimated using either genome-wide marker data or pedigree (after correction) to determine the proportion of self-relationship (1.00 relatedness), full-sibs (0.50), half-sibs (0.25), and unrelated (0.00) individuals. For the 1,540 genotyped trees, we determined a total of 2,371,600 pairwise relationships. After pedigree correction, the value distribution showed that 98.81% (2,343,490) of which involved estimates for unrelated individuals (according to the pedigree), while half-sibs represented 1.11% (26,210) and full-sibs 0.02% (360) ( Table 3). A comparison of the pedigree expected and genomic realized relationship matrices is also depicted using the distribution of the number of pairwise additive relationships (S3 Fig). A good pedigree control in the production of the unrelated, half-sib and full-sib families is shown, although SNP marker data, by capturing the realized genetic relationships, provided considerably more refined estimates of the continuous distribution of true relatedness in the genotyped population.

Heritability estimates
Overall, narrow-sense heritability estimates based on genomic relationship matrices were generally (35 out of 42 site-trait combinations) higher than those based on the pedigree relationship matrices (average of 0.54 and 0.43 across traits and sites, respectively; Fig 2). However, standard errors for heritability estimates were found to be lower for the pedigree-(0.16 averaged across traits and sites) versus genomic-based (0.19) models (Table 4).
Across test sites and relationship matrices, heritability estimates for growth traits (HT and DBH) ranged from low to high with an average estimate of 0.73 (range: 0.06-0.97). Wood quality traits (WD and MFA) showed low to moderate narrow-sense heritability estimates, averaging 0.34 (range: 0.05-0.78). Among the test sites, CARS showed significantly lower heritability estimates for DBH and WD, and the lowest MFA heritability estimate was found at the REDE. Both dendrochronological drought indices, Resistance and Sensitivity, showed moderate to high heritability estimates for CALL and REDE with values ranging from 0.25 to 0.80 (average 0.49). However, these values were near zero for CARS, i.e., with no heritable variation (additive genetic variation). For the trait δ 13 C, moderate to high heritability estimates were found with values ranging from 0.57 to 0.98 (average 0.81). Heritability estimates for monoterpene compounds, however, showed a lack of consistency, with values ranging from 0.00 to 0.96 (averaged of 0.45). Total monoterpenes showed slightly lower heritability estimates than the individual monoterpenes, ranging from 0.08 to 0.64 (average 0.39). Again, CARS showed lower heritability estimates than the other white spruce test sites for total monoterpenes (see Table 4 for details).

Traits genetic correlations
Overall, genomic-based relationship genotypic correlation estimates are equivalent to those from the classical pedigree-based relationship with a similar average (of 0.23) across the 105 trait-pair combinations; and varied from -0.81 to 0.99 and -0.79 to 1.00, for pedigree-and genomic-based genetic correlation estimates, respectively (Fig 3 and S1 Table). However, Table 3. Statistics of pairwise relatedness coefficients. Statistics of pairwise relatedness coefficients for self-relationship coefficients, full-sibs, and half-sibs and unrelated genotyped trees, for both the pedigree (after pedigree correction A-matrix) and genomic information from all available SNPs (467K) (G-matrix). Across test sites and relationship matrices, estimates of genetic correlations between DBH and HT were consistently high and positive, ranging between 0.87 and 0.93 (average 0.90). Low to moderate negative or positive correlations were apparent between growth and wood quality traits (WD and MFA) (range: -0.50-0.33), with some inconsistency across sites especially between growth traits and MFA. Generally, consistently low to moderate negative correlations between growth variables and growth resistance (Resistance) were found within and across sites (range: -0.18 --0.65). However, correlations between growth and Sensitivity traits were high and positive (range: 0.40-0.78) for CALL and REDE, and negative (range: -0.31 --0.08) for CARS. The correlation between growth traits and δ 13 C varied from 0.20 to 0.70. Genetic correlation estimates between growth traits and monoterpene compounds and total monoterpenes were low to moderate. For CALL, correlation coefficients were mostly positive (range: -0.11-0.28), whereas in CARS and REDE low and negative correlations were generally found (range: 0.08 --0.49) (Fig 3 and S1 Table). The genetic correlations between the two wood quality traits (WD and MFA) were consistently negative across sites (range: -0.10 --0.42). Negative correlations were also identified between WD and the drought indices (-0.08 to -0.26 for Resistance, and -0.15 to 0.15 for Sensitivity). In contrast to WD, MFA showed strong and positive correlation values with Resistance (range: 0.39-0.77), while the genetic correlation between MFA and Sensitivity remained low to moderate, and negative (range -0.45-0.19). Genetic correlations between WD and monoterpene compounds and total monoterpenes were generally low and positive, meanwhile MFA also showed generally low but both positive and negative genetic correlation estimates with the various monoterpene compounds.

A-matrix G-matrix A-matrix G-matrix A-matrix G-matrix
The adaptability-related Resistance trait showed a low negative correlation with δ 13 C (range: -0.37 --0.01) and in general positive correlation with Sensitivity (range: -0.01-0.24). Further, the correlations between the two drought resistance traits varied across sites. For example, the genetic correlations between Resistance and Sensitivity averaged across the two relationship matrices were, -0.80 for REDE, -0.14 for CALL and, with an important variation across the two relationship matrices, 0.19 for CARS. Resistance showed statistically significant low and negative correlations with monoterpenes for CALL, low to moderate positive correlation for CARS, and was low but statistically not significant for REDE. For the Sensitivity and monoterpene associations, strong positive genetic correlations were found for CALL (range: 0.33-0.63), while in REDE, these correlations were mostly non-significant (range: -0.12-0.09, Fig 3 and S1 Table). Correlation estimates between δ 13 C values and monoterpene compounds and total monoterpenes also varied across sites, with low and negative values for CARS and positive relationships in the remaining sites, although statistically non-significant with relatively large standard errors. Finally, the genetic correlation estimates between monoterpene compounds (including total monoterpenes) were generally moderate to strong, positive and consistent across sites (Fig 3 and S1 Table). Table 4. Estimated narrow-sense heritability and their approximate standard error (SE), for each growth, wood quality, drought resilience and chemical traits in the white spruce population. Heritability estimates were estimated using the pedigree-(A-matrix) and genomic-based (G-matrix) relationship matrices constructed from all available SNPs (467K). Abbreviations used for the traits and sites are described, respectively, in the text and Table 1.  Table). Overall, genetic correlations between sites were positive with relatively small standard errors. However, inconsistency was observed, potentially reflecting the climatic conditions between CARS and the other two sites (CALL and REDE) (see Table 1 and discussion below). While average genetic correlation estimates across traits and relationship matrices were strong for the CALL and REDE pair (0.76), the lowest correlations were obtained between the sites CALL and CARS (0.48) and REDE and CARS (0.52), in particular for the growth and MFA traits (Fig 4 and S2 Table).  For the growth traits (HT and DBH), the average across pedigree-and genomic-based relationship estimates of genetic correlations between pairs of sites was moderate (0.44 and 0.53, respectively). However, as we mentioned above, these correlations were inconsistent for the pairs of sites that involved CARS, with significantly lower and imprecise (relatively large standard errors) site-to-site genetic correlations. For wood quality traits, genetic correlations among sites were high and consistent across pairs of sites for WD (average 0.98, range 0.97-0.98), while estimates for MFA across sites showed some degree of variability (average 0.66, range: 0.37-0.92) with the lowest correlations (and largest standard errors) also for the pairs involving CARS.

A-matrix G-matrix A-matrix G-matrix
For the adaptability-related drought indices, the genetic correlation among sites for Resistance and Sensitivity, ranged from -0.02 to 0.93, but in general, these estimates were associated with relatively large standard errors, except for Sensitivity between CALL and REDE. Furthermore, significant positive genetic correlations for the WUE related isotopic δ 13 C values were found across sites and the two relationship matrices studied (average 0.92, range: 0.80-0.97, Fig 4 and S2 Table).
Genetic correlations for the monoterpene compounds among sites were positive, and ranged from moderate to strong with an average of 0.70 (range: 0.15-0.94). Potentially owing to the smaller sample size (n < 1,183, Table 2), the standard errors of the genetic correlations for β-pinene, camphor, and terpinolene between CALL and REDE were larger. Moreover, CARS was not included in these multiple-site analyses of these three compounds as there was insufficient phenotypic data (n < 30) available. For total monoterpenes, genetic correlations among sites were positive and strong, and consistent across sites, with an average across the two relationship matrices of 0.90 (range: 0.86-0.96).

Discussion
Considerable effort have been committed to quantitative genetic analyses of several tree species' growth and wood quality productivity-related traits. While the need to identify adaptability-related trait genotypes grows, less effort has been directed towards the selection of pest and disease resistant trees, and even less for the selection of drought resistant/resilient individuals. Here, we provide a comprehensive quantitative pedigree and genomic analyses of growth, wood quality, drought resilience, and monoterpene traits in a white spruce breeding population. Accurate estimates of narrow-sense heritability and genetic correlation estimates among traits within and across-sites were obtained and are expected to provide valuable information to breed and assist in the selection of resistant/resilient genetic material for increasing productivity and adaptability of future white spruce forests.

Trait genetic control
Genetic parameters and their function, such as heritability and correlations, play an important role in the selection of parents in a breeding program. However, these values are context dependent, as they depend on the relative contributions of genetic and environmental variations in a specific population, and vary among traits and across measurement ages [52]. While height (HT) heritability estimates (Table 4) showed values somewhat higher than those reported in earlier white spruce studies [10,14,16], likely as a result of unintentional sampling artifacts, heritability estimates for diameter (DBH) are consistent with earlier observations in other forest tree species [10,53]. Although wood density (WD) heritability estimates were comparable to those reported earlier, the pedigree-and genomic-based relationships produced variable results, similar to earlier observations [3,4,14]. Microfibril angle (MFA) showed low to moderate genetic control, consistent with results from other white spruce studies [4].
Recent quantitative studies in conifers using population [54], family structure [21], or genomic [55][56][57] information have used tree-ring traits, such as the short-term index resistance, to analyze the genetic variation and genetic architecture in drought responses. Here, we studied two short-term indices (i.e., Resistance and Sensitivity) and both produced low to moderate heritability estimates, results similar to those reported by Depardieu et al. [21] in white spruce at a single-site. However, some variation across sites was observed, at CARS for Resistance and Sensitivity, results similar to those reported by Zas et al. [54] who quantified the genetic variation of resilience and resistance indices in two different sites located in central Spain subjected to similar drought events (intensity, timing and duration) in maritime pine (Pinus pinaster Ait.). Zas et al. [54] indicated that differences between sites in response to extreme drought events should not be attributed to differences associated with the extreme event itself, but to other microenvironmental factors such as topography, soil depth and stoniness that existed between sites. CARS is the highest elevation site with higher summer precipitation and lower summer temperatures, when compared to the other two white spruce test sites (Table 1). It is therefore plausible that trees in CARS were not exposed to equivalent or severe drought conditions compared to the other two sites to express differences in Resistance and Sensitivity over the same time period (2011)(2012)(2013)(2014)(2015).
Resistance to stress is often difficult to measure and depends on a complex network of functional traits at multiple scales [58]. In trees, stable carbon isotope ratio (δ 13 C) values can be used as an index of integrated long-term water use efficiency (WUE), expressing the ratio of carbon fixed to water lost as related to stomatal function. Moreover, δ 13 C may serve as a guide for parental selection decisions for seed production, to identify genotypes with contrasting growing strategies, elucidating the underlying mechanisms of complex physiological traits [59], or selecting genotypes for high WUE without compromising yield [60]. In our study, we showed that there is significant potential for selection using δ 13 C information, as the genetic variation in δ 13 C was moderate to high, and comparable to earlier reports (Johnsen et al. Maximizing growth in future climate scenarios with increased pest activity and drought events requires an understanding of the natural variability of quantitative resistance to disease [65] and drought tolerance. In a review on conifers, Kopaczyk et al. [66] indicated that plant secondary metabolites such as terpenes are not involved in vital processes, but may be essential for some conifers to adapt to unfavourable abiotic conditions such as drought stress by increasing levels of constitutive defenses. For instance, total monoterpenes increased significantly in Pinus sylvestris (39%) and Picea abies (35%) trees under a severe drought relative to that of the control [67]. When Picea abies was subjected to water stress, the contents of tricyclene, α-pinene, and camphene were significantly higher than the control trees [68]. Therefore, trees showing resistance to insect attacks or drought events can produce higher levels of secondary chemicals compared with trees susceptible to insects or non-drought stressed trees. In spite of the importance of these compounds in relation to adaptability-related traits, few studies have focused on the genetic control (i.e., heritability estimates) of secondary compounds in forest trees. Hanover [69] reported heritability estimates of five monoterpenes (four of which are included in our study) in Pinus monticola ranging from 0.38 to 0.95, with heritability values all within the ranges obtained in our study.
Overall, our results showed that estimates of heritability using the genomic relationship matrix from 467K SNP markers were greater than those estimated using the pedigree relationship matrix (average across traits and sites 0.54 vs. 0.43, respectively; Fig 2), demonstrating that the genetic variance captured depended on whether a pedigree-or genomic-based relationship matrix was used. These results agree with those reported by Tan et al. [70], in Eucalyptus, where heritability estimates obtained from genomic information were higher than those from the pedigree, for both growth and wood quality traits. In contrast, Lenz et al. [71] and Gamal El-Dien et al. [72] found heritability estimates from the genomic relationship matrix lower than those estimated from the pedigree relationship matrix for growth and wood quality traits in Pice mariana and Picea glauca × Picea engelmannii, respectively. However, similar heritability estimates from pedigree and genomic information were obtained for HT in Picea abies [73] and HT and MFA in lodgepole pine [29]. These results highlight the differences in genetic parameter estimates that exist between different relationship matrices. The cause of these differences may be attributable to different causes, like different data sets or noise due to uncertainty in the estimates [74]. Interestingly, differences in genetic variance estimates may also exist as a consequence of the fact that pedigree-and genomic-based relationships matrices refer to different base populations, where genomic relationship matrices reflects the genotyped population whereas the pedigree relationships reflects the founders of the pedigreed population [74].

Relationship among traits
Trait genetic correlations are important for demonstrating their associated genetic responses (how selection on one trait affects the mean and potentially genetic variation in another). This is particularly important for breeders to better understand the interplay between the productivity-related and/or adaptability-related traits. Although higher genetic correlations were observed among growth traits (i.e., DBH and HT) (Fig 3 and S1 Table), such correlation values indicate that selection for any one of these traits alone would give a high correlated response in the other traits, providing an opportunity to efficiently allocate assessment efforts. Our results confirmed previous observations in white spruce [75] and other conifer species [36, 76,77]. For example, Rweyongeza [75] using progeny trials from the same white spruce series studied here, reported DBH-HT genetic correlation estimates of 0.76 to 0.94 (average 0.85) for age-20 and 30 measurements.
The reported genetic correlations suggest that the selection for rapid growth could result in a small decrease in WD (Fig 3 and S1 Table). Earlier studies in several tree species have shown that genetic correlations between growth traits and WD are negatively correlated, but may also vary with environmental factors (e.g., location, site conditions) [78]. Moreover, different results concerning the relationships between growth rate and WD may be expected, given that WD is a complex trait influenced by many factors [79]. For instance, either negative [13], or no/minor and negative [14] genetic correlation relationships were reported for WD and HT in white spruce. Our results also showed that the genetic correlation between growth traits and MFA depended on the site, as the genetic correlations were low to moderate, as well as negative or positive. A low and negative correlation (-0.31) between MFA and HT was obtained by Park et al. [13] in white spruce; but high and positive or negative correlations (0.71 and -0.52 were reported for MFA and DBH) [80] or moderate correlations (0.40 and 0.39 at age 10 and 25, respectively) [81] in Norway spruce.
Unfavourable results were obtained for the relationship between growth traits and the short-term index Resistance; therefore, selection of larger trees (greater in height and diameter) could result in a decrease in resistance to drought under climate change. Mean drought sensitivity (Sensitivity) also showed an unfavourable relationship with growth traits in CALL and REDE but favourable in CARS, highlighting that differences in relationships can be associated with local environmental conditions [54] (see discussion in the following sub-section). Trujillo-Moya et al. [55] showed that drought resistance was found to be positively correlated with mean annual increment in a 35-year-old Norway spruce provenance test, however, Montwé et al. [82] also showed some contrasting results depending on the origin of the climatic regions from which 35 lodgepole pine provenances where selected. Montwé et al. [82] also found a trade-off between tolerance to drought and growth only for the most southern (U.S. A.) lodgepole pine population, while the central and southern interior British Columbia (Canada) populations showed an ability to tolerate drought and to maintain comparatively good long-term growth.
Our results showed that faster-growing trees were positively correlated with higher WUE (higher δ 13 C values) and this association was strong (Fig 3 and S1 Table). Therefore, these results suggest that δ 13 C is a useful criterion for selecting fast growing genotypes with higher WUE. The positive genetic correlation between growth and WUE could arise from several mechanisms. First, the genetic variation in WUE might be driven by the variation in carbon assimilation rate, which in turn, was positively correlated with growth [63]. An alternative interpretation could suggest that the genetic variation in WUE was driven by the variation in stomatal conductance, and taller trees might have lower stomatal conductance due to hydraulic constraint, as found in Pinus pinaster (maritime pine) [83]. However, further research is needed to elucidate if our δ 13 C findings were driven by the genetic variation in assimilation rate or by stomatal conductance in the studied population, and to explore the causes of the stronger association found between δ 13 C and growth traits. Previous studies evaluating field trials did not show the existence of a general trend between growth traits and WUE [61,63,[84][85][86]. For instance, in Picea mariana, negative [61] and positive [63] correlations between growth and δ 13 C were found.
Wood characteristics have been suggested as screening traits for drought sensitivity to identify drought tolerant individuals [87,88]. Denser wood is typically associated with xylem that is more resistant to hydraulic failure [89]. Our analysis generally showed some unfavourable relationship between WD and Resistance (negative and low correlation), suggesting that average WD values could be a poor predictors of mean drought sensitivity and thus other physiological parameters may be required. Sebastian-Azcona et al. [90] found no differences in cavitation resistance between different provenances of white spruce, which also suggests that other traits such as root water uptake or stomata regulation might have a stronger effect on the inherent differences to drought resistance. George et al. [91], in the genus Abies, found that the average ring density had either a negative relationship to resistance or positive relationships to recovery, resilience and relative resilience, as well as no or only weak correlations with different drought events. Other physical properties of wood structure such as MFA may also provide information about tree sensitivity to drought events. Our results suggest that higher MFA values are associated with more drought tolerant trees (higher values of Resistance; positive correlation). Higher MFA may enable the tracheid to bear higher hoop stresses when a tracheid is under high tension given greater resistance against cell collapse during drought events [92]. However, changes in MFA as a reaction to the environment are still poorly understood [92]. In summary, various wood characteristics may be related to drought sensitivity, because the vulnerability of the xylem conduits to hydraulic failure depends on lumen diameter and length as well as on cell wall thickness [55].
In general, our results showed positive genetic correlations between WD and δ 13 C, with the highest correlations for CALL and REDE (average across the two relationships of 0.23 and 0.30, respectively). Previous studies in Fagus sylvatica [93], showed that the phenotypic relationship between WD and δ 13 C differed between dry and wet years across sites. For wet years, WD and δ 13 C was negatively correlated and, in dry years δ 13 C increased with increasing wood density (i.e., positive correlation). Therefore, the higher values observed in the mentioned sites probably are associate with dryer environments. This conclusion can explain the results obtained for CALL and REDE as they are at lower elevation and are drier sites as compared to CARS, located at a higher elevation with relatively moist conditions (Table 1).
Resistance and Sensitivity drought indices were marginally negatively or positively correlated with δ 13 C, respectively, suggesting that trees most resistant to a drought event have low WUE (i.e., low δ 13 C values), at least in CARS and REDE, the sites with the highest correlation values (average across relationships, -0.28 and -0.35, respectively). Jucker et al. [86] showed that δ 13 C values provided a reliable and powerful indicator of drought across a wide range of forest tree species growing in different environmental conditions. As stated above, they did not find enough evidence to suggest that the increase in δ 13 C was associated with the significant decline in stem growth; however, they showed a clear association between increased δ 13 C and decreased growth under drought conditions in four sites along a Picea abies latitudinal gradient (-31.7% on average, see Fig 2 in Jucker et al. [86]), confirming our results.
There is evidence that terpenes are important components of conifer defenses [94,95]. It has been shown that some types of stress conditions, such as drought or temperature fluctuation enhance or inhibit the production of terpenes, modify their emission pattern or/and quantity [66]. Thus, the effect of abiotic stress on monoterpenes could explain the different responses across the study sites. For instance, the most resistant trees to drought stress showed low and negative correlations with all the monoterpenes studied at CALL (average correlations across monoterpenes and relationships was -0.25) and positive at CARS (average correlations across monoterpenes and relationships was 0.41). The correlation with the α-pinene concentration (a foliar protectant against Choristoneura fumiferana feeding [42]) was the most negative at CALL (-0.34) and the most positive at CARS (0.58). Moreover, it has also been demonstrated that protective compounds produced by plants subjected to biotic stress may enhance their tolerance to abiotic stress [66], the so called "cross-talk" between biotic and abiotic stress responses [96].
Finally, our study also compared the genetic correlation estimates between traits using the classical infinitesimal model from the pedigree information with those estimates from the genomic information. From theory, standard pedigree-based linear models capture expected genetic covariation, whereas marker-based models capture genetic covariation that is marked by SNPs [97]. Therefore, it is expected that for some of these traits, the estimated correlations may depend on the type of information. Our results showed that genome-based correlations generally reaffirm the pedigree-based correlations, but some pairs of traits disagree, either with missing correlations (i.e., the pedigree estimates were higher than those from SNP markers) or excessive correlations (i.e., the SNP markers estimates were higher than those from pedigree) (S4 Fig). Momen et al. [97] highlighted that some care should be taken when interpreting and using genetic parameters estimated via molecular markers, as predictions for complex traits based on pedigree data may differ from those based on SNP data, simply due to chance or other reasons, such as the extent of linkage disequilibrium (LD) between markers and the unknown quantitative trait loci (QTL). To potentially capture parts of the genetic covariance among traits that are not accounted for by either pedigree or genomic information alone, we recommend combining the pedigree and genomic information using the single-step GBLUP approach that combines pedigree and genomic relationship matrix [98] as applied to white spruce [3,14] and lodgepole pine [2].

Genetic-environmental correlations
The availability of multi-environmental forest genetics trials makes it feasible to evaluate both the magnitude and importance of the genotype by environment (G×E) interactions [99]. When these interactions are high (genetic correlations < 0.70), breeders must decide whether to select for performance stability and accept a slower rate of population improvement or to develop populations specifically adapted to each environment for gain maximization, however, the latter strategy is usually associated with greater program costs [100].
Despite being in the same breeding region (D1), the climatic conditions varied across the test sites, with the mean annual temperature (MAT) and precipitation (MAP) ranging from 1.3 to 2.9˚C and 442 to 535 mm during the trial period 1986-2019 period, respectively (Table 1). Among the test sites, CARS is higher in elevation, with the highest MAT and lowest mean warmest month temperature (MWMT; i.e., coolest summer), and highest annual precipitation and moisture (see Table 1), while the lower elevation CALL and REDE sites experienced warmer summers, and had lower annual precipitation and moisture index. Overall, these climate differences between CARS and both CALL and REDE sites might explain the high G×E interactions observed for the site-to-site pairs involving CARS, while the climate similarity between CALL and REDE may explain the low G×E interactions between these two sites (Fig 4 and S2 Table), in spite of the large geographic distance between them (Fig 1). It should also be mentioned that CALL and REDE were attacked by white pine weevil (Pissodes strobi Peck), a pest that destroys the leading shoot growth.
For HT and DBH, higher G×E interactions were observed for the analyses involving CARS (Fig 4 and S2 Table), suggesting selection for growth at CARS should be considered independently for its unique climate, as well as the absence of damage by white pine weevil. In contrast, for wood quality traits, such as WD and MFA, our results indicated a neglectable G×E effect. Similar results have previously been reported for several conifer species [2,[101][102][103][104][105]. These studies revealed that G×E interaction for WD is not very important (lodgepole pine: Ukrainetz and Mansfield [2]  The importance of examining the genetic variation in drought resilience across a range of extreme climate events and across sites has been emphasized [54]. However, to date, most studies have focused on single test sites [21,55,56]. Our findings showed high variability in genetic correlations between study sites with relatively large standard errors for the drought response indices, except for mean drought sensitivity (Sensitivity) between CALL and REDE. We have concluded that selection for drought resistant genotypes can only be made for sites with similar climate indices (such as the CALL and REDE in this study). How forests and trees react to drought is complex and varies across stands, sites, regions, and continents depending on multiple factors including climate conditions [107]. Moreover, as we mentioned before, these differences are likely due to other microenvironmental factors that existed between these three white spruce test sites, as indicated by Zas et al. [54].
The small G×E interaction reported in our work for δ 13 C are in agreement, generally, with other previous conifer studies [61,108]. Johnsen et al. [61] showed no evidence for a G×E interaction for foliar carbon isotope discrimination in Picea mariana. Guy and Holowachuk [108] reported no significant G×E interactions for δ 13 C in 10 lodgepole pine provenances tested on three sites in British Columbia (Canada) with contrasting soil moisture and climate. However, Baltunis et al. [59] observed a lower value of Type B total genetic correlation (0.64) in 1,000 Pinus taeda cloned full-sib families tested on two contrasting sites. Cregg et al. [109] also observed strong G×E interaction for stable carbon isotope discrimination in mature Pinus ponderosa at two contrasting locations in the Great Plains (USA), caused by growth phenology variation among seed sources. Information on G×E interaction for δ 13 C of white spruce field trials is extremely limited.
Finally, in agreement with two previous studies [110, 111] we observed, with only a few exceptions, low G×E interactions in the monoterpene compounds and total monoterpenes. Few studies have investigated G×E interaction of monoterpenes, probably, as stated by Ott et al. [111], due to the need for spatially replicated field trials of trees with known pedigree that are of an appropriate age for biotic challenges of interest. Hanover [110] showed that five cortical monoterpene concentrations (four used in this study) were quite stable across three clonal Pinus monticola trials established in contrasting sites in Idaho (USA). Ott et al. [111] also found that only a few monoterpene compounds from phloem tissue showed significant family × environment interactions in two OP progeny trials of lodgepole pine established in north central British Columbia. Based on these results, we can conclude that monoterpene compounds found in needles are relatively independent of climate and site characteristics, at least, within the studied D1 white spruce breeding region.

Implications for white spruce breeding
The ultimate goal of forest tree breeding and testing programs is to evaluate parents and their offspring across multiple sites, and for a reasonable duration to make reliable selection for the next breeding cycle. These efforts allow for the establishment of seed orchards for the reliable and abundant production of improved seed needed for reforestation programs today, and typically for the life of the orchard. Here, we tested and genotyped 80 of 150 families from a white spruce breeding population planted on three sites within one breeding zone for multiple traits including growth, wood quality, drought resistance, and chemical compounds associated with biotic and abiotic resistance using genomic (SNPs) and pedigree derived relationships. We compared the target attributes' genetic parameters (estimates of heritability, genetic correlation, and G × E interactions) using the two relationship methods to reach a reliable conclusion on selecting the appropriate genetic evaluation method as well as understanding the interplay among the selection traits to allow for more rapid evaluation without compromising the selection accuracy. The choice of selection attributes is of vital importance considering the time and effort needed for phenotypic traits evaluation, understand the correlated responses among the target traits, and finally the G × E interactions. In this regard, a number of findings can be made based on the results of this study aiming at improving white spruce breeding efforts challenged with the need to increase the scope of selections attributes and target deployment environments. The key findings include the following: a) use of δ 13 C as a relatively easy to measure trait and is an excellent proxy to WUE and growth rate, b) use of secondary chemical compounds (monoterpenes) as an indicator of a selected trees propensity to show insect and/or drought resistance, c) while the G-matrix provided better genetic parameter estimates than the A-matrix, the inconclusiveness of the former in some cases indicated that a blind approach (i.e., single-step GBLUP approach) of these relationships would be best, d) the existence of positive and negative genetic correlations among the studied traits cannot be overlooked during selection, e) the unfavourable relationship between growth and wood quality traits with drought resistance indices (negative correlations), indicating the importance of proper trait(s) choice for selecting under expected increasing drought environment with climate change, f) the value of chemical compounds "cross-talk" as an indicator for tolerance to biotic and abiotic stress, g) the magnitude and trajectory of G × E interaction as it determines the selection strategy (i.e., specialists vs. generalists), which is essential for seed orchards establishment, and h) the value of multiple site testing, especially for drought resistance, as variability among testing sites provide insight into site differences even if they are within one breeding zone. We believe that the lessons learned from this study will provide valuable information in the future selection and breeding of the white spruce population in Alberta and elsewhere.  Table. Estimated genetic correlations (and approximate standard errors) between the different traits from the multiple-trait analysis using the pedigree-(A-matrix, above diagonal) and genomic-based (G-matrix, below diagonal) relationship matrices for white spruce in each of the three test sites. Abbreviations used for the traits and sites are described, respectively, in the text and Table 1. Table. Estimated genetic correlations (and approximate standard errors) between the different sites from the multiple-site analysis using the pedigree-(A-matrix, above diagonal) and genomic-based (G-matrix, below diagonal) relationship matrices for white spruce in each of the three test sites. Abbreviations used for the traits and sites are described, respectively, in the text and Table 1